Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 100000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 18.3 MiB |
| Average record size in memory | 192.0 B |
Variable types
| Categorical | 13 |
|---|---|
| Text | 1 |
| Numeric | 9 |
| DateTime | 1 |
article.1 is highly overall correlated with category and 11 other fields | High correlation |
category is highly overall correlated with article.1 and 11 other fields | High correlation |
cost is highly overall correlated with article.1 and 9 other fields | High correlation |
country is highly overall correlated with customer_id | High correlation |
current_price is highly overall correlated with regular_price | High correlation |
customer_id is highly overall correlated with country | High correlation |
gender is highly overall correlated with article.1 and 8 other fields | High correlation |
productgroup is highly overall correlated with article.1 and 8 other fields | High correlation |
regular_price is highly overall correlated with current_price | High correlation |
rgb_b_main_col is highly overall correlated with article.1 and 9 other fields | High correlation |
rgb_b_sec_col is highly overall correlated with article.1 and 9 other fields | High correlation |
rgb_g_main_col is highly overall correlated with article.1 and 9 other fields | High correlation |
rgb_g_sec_col is highly overall correlated with article.1 and 9 other fields | High correlation |
rgb_r_main_col is highly overall correlated with article.1 and 7 other fields | High correlation |
rgb_r_sec_col is highly overall correlated with article.1 and 9 other fields | High correlation |
sizes is highly overall correlated with article.1 and 6 other fields | High correlation |
style is highly overall correlated with article.1 and 6 other fields | High correlation |
promo1 is highly imbalanced (66.5%) | Imbalance |
promo2 is highly imbalanced (95.5%) | Imbalance |
sizes is highly imbalanced (53.1%) | Imbalance |
article.1 is uniformly distributed | Uniform |
rgb_b_main_col has 10000 (10.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-05-08 09:30:14.090401 |
|---|---|
| Analysis finished | 2025-05-08 09:30:28.615095 |
| Duration | 14.52 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
country
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| Germany | |
|---|---|
| Austria | |
| France |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.8454 |
| Min length | 6 |
Characters and Unicode
| Total characters | 684540 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Germany |
|---|---|
| 2nd row | Germany |
| 3rd row | Germany |
| 4th row | Germany |
| 5th row | Germany |
Common Values
| Value | Count | Frequency (%) |
| Germany | 49400 | |
| Austria | 35140 | |
| France | 15460 | 15.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| germany | 49400 | |
| austria | 35140 | |
| france | 15460 | 15.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 100000 | |
| a | 100000 | |
| e | 64860 | |
| n | 64860 | |
| G | 49400 | |
| m | 49400 | |
| y | 49400 | |
| A | 35140 | 5.1% |
| u | 35140 | 5.1% |
| s | 35140 | 5.1% |
| Other values (4) | 101200 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 684540 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 100000 | |
| a | 100000 | |
| e | 64860 | |
| n | 64860 | |
| G | 49400 | |
| m | 49400 | |
| y | 49400 | |
| A | 35140 | 5.1% |
| u | 35140 | 5.1% |
| s | 35140 | 5.1% |
| Other values (4) | 101200 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 684540 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 100000 | |
| a | 100000 | |
| e | 64860 | |
| n | 64860 | |
| G | 49400 | |
| m | 49400 | |
| y | 49400 | |
| A | 35140 | 5.1% |
| u | 35140 | 5.1% |
| s | 35140 | 5.1% |
| Other values (4) | 101200 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 684540 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 100000 | |
| a | 100000 | |
| e | 64860 | |
| n | 64860 | |
| G | 49400 | |
| m | 49400 | |
| y | 49400 | |
| A | 35140 | 5.1% |
| u | 35140 | 5.1% |
| s | 35140 | 5.1% |
| Other values (4) | 101200 |
article
Text
| Distinct | 477 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 600000 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | YN8639 |
|---|---|
| 2nd row | YN8639 |
| 3rd row | YN8639 |
| 4th row | YN8639 |
| 5th row | YN8639 |
| Value | Count | Frequency (%) |
| br3179 | 610 | 0.6% |
| mr4948 | 560 | 0.6% |
| xg6449 | 550 | 0.5% |
| aa7884 | 540 | 0.5% |
| op1184 | 520 | 0.5% |
| vs6613 | 510 | 0.5% |
| qs5396 | 510 | 0.5% |
| cb4942 | 510 | 0.5% |
| st3419 | 490 | 0.5% |
| ze9366 | 480 | 0.5% |
| Other values (467) | 94720 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 49480 | 8.2% |
| 6 | 47430 | 7.9% |
| 7 | 46260 | 7.7% |
| 2 | 45710 | 7.6% |
| 4 | 44500 | 7.4% |
| 1 | 43690 | 7.3% |
| 3 | 42760 | 7.1% |
| 9 | 41580 | 6.9% |
| 5 | 38590 | 6.4% |
| X | 10380 | 1.7% |
| Other values (25) | 189620 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 600000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 8 | 49480 | 8.2% |
| 6 | 47430 | 7.9% |
| 7 | 46260 | 7.7% |
| 2 | 45710 | 7.6% |
| 4 | 44500 | 7.4% |
| 1 | 43690 | 7.3% |
| 3 | 42760 | 7.1% |
| 9 | 41580 | 6.9% |
| 5 | 38590 | 6.4% |
| X | 10380 | 1.7% |
| Other values (25) | 189620 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 600000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 8 | 49480 | 8.2% |
| 6 | 47430 | 7.9% |
| 7 | 46260 | 7.7% |
| 2 | 45710 | 7.6% |
| 4 | 44500 | 7.4% |
| 1 | 43690 | 7.3% |
| 3 | 42760 | 7.1% |
| 9 | 41580 | 6.9% |
| 5 | 38590 | 6.4% |
| X | 10380 | 1.7% |
| Other values (25) | 189620 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 600000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 8 | 49480 | 8.2% |
| 6 | 47430 | 7.9% |
| 7 | 46260 | 7.7% |
| 2 | 45710 | 7.6% |
| 4 | 44500 | 7.4% |
| 1 | 43690 | 7.3% |
| 3 | 42760 | 7.1% |
| 9 | 41580 | 6.9% |
| 5 | 38590 | 6.4% |
| X | 10380 | 1.7% |
| Other values (25) | 189620 |
sales
Real number (ℝ)
| Distinct | 476 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 56.7818 |
| Minimum | 1 |
|---|---|
| Maximum | 898 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 10 |
| median | 26 |
| Q3 | 64 |
| 95-th percentile | 216 |
| Maximum | 898 |
| Range | 897 |
| Interquartile range (IQR) | 54 |
Descriptive statistics
| Standard deviation | 87.934743 |
|---|---|
| Coefficient of variation (CV) | 1.5486431 |
| Kurtosis | 20.657374 |
| Mean | 56.7818 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 3.8588957 |
| Sum | 5678180 |
| Variance | 7732.5191 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 3080 | 3.1% |
| 1 | 3060 | 3.1% |
| 3 | 2950 | 2.9% |
| 4 | 2800 | 2.8% |
| 5 | 2680 | 2.7% |
| 6 | 2670 | 2.7% |
| 8 | 2380 | 2.4% |
| 7 | 2380 | 2.4% |
| 9 | 2160 | 2.2% |
| 11 | 2130 | 2.1% |
| Other values (466) | 73710 |
| Value | Count | Frequency (%) |
| 1 | 3060 | |
| 2 | 3080 | |
| 3 | 2950 | |
| 4 | 2800 | |
| 5 | 2680 | |
| 6 | 2670 | |
| 7 | 2380 | |
| 8 | 2380 | |
| 9 | 2160 | |
| 10 | 1940 |
| Value | Count | Frequency (%) |
| 898 | 10 | |
| 883 | 10 | |
| 881 | 10 | |
| 852 | 10 | |
| 841 | 10 | |
| 827 | 10 | |
| 819 | 10 | |
| 818 | 20 | |
| 797 | 10 | |
| 796 | 10 |
regular_price
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 123 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.3912 |
| Minimum | 3.95 |
|---|---|
| Maximum | 197.95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 3.95 |
|---|---|
| 5-th percentile | 6.95 |
| Q1 | 25.95 |
| median | 40.95 |
| Q3 | 79.95 |
| 95-th percentile | 120.95 |
| Maximum | 197.95 |
| Range | 194 |
| Interquartile range (IQR) | 54 |
Descriptive statistics
| Standard deviation | 35.272128 |
|---|---|
| Coefficient of variation (CV) | 0.67324527 |
| Kurtosis | 0.32235243 |
| Mean | 52.3912 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 0.90371157 |
| Sum | 5239120 |
| Variance | 1244.123 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26.95 | 3620 | 3.6% |
| 29.95 | 3160 | 3.2% |
| 30.95 | 3120 | 3.1% |
| 23.95 | 3110 | 3.1% |
| 62.95 | 2690 | 2.7% |
| 25.95 | 2540 | 2.5% |
| 44.95 | 2420 | 2.4% |
| 20.95 | 2330 | 2.3% |
| 3.95 | 2090 | 2.1% |
| 83.95 | 1920 | 1.9% |
| Other values (113) | 73000 |
| Value | Count | Frequency (%) |
| 3.95 | 2090 | |
| 4.95 | 570 | 0.6% |
| 5.95 | 1270 | |
| 6.95 | 1330 | |
| 7.95 | 170 | 0.2% |
| 8.95 | 680 | 0.7% |
| 9.95 | 510 | 0.5% |
| 10.95 | 800 | 0.8% |
| 11.95 | 130 | 0.1% |
| 12.95 | 190 | 0.2% |
| Value | Count | Frequency (%) |
| 197.95 | 120 | 0.1% |
| 195.95 | 160 | 0.2% |
| 153.95 | 850 | |
| 150.95 | 150 | 0.1% |
| 141.95 | 90 | 0.1% |
| 139.95 | 240 | 0.2% |
| 136.95 | 200 | 0.2% |
| 135.95 | 270 | 0.3% |
| 134.95 | 150 | 0.1% |
| 132.95 | 490 |
current_price
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 141 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.2908 |
| Minimum | 1.95 |
|---|---|
| Maximum | 195.95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 1.95 |
|---|---|
| 5-th percentile | 3.95 |
| Q1 | 11.95 |
| median | 20.95 |
| Q3 | 37.95 |
| 95-th percentile | 74.95 |
| Maximum | 195.95 |
| Range | 194 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 22.578343 |
|---|---|
| Coefficient of variation (CV) | 0.79808074 |
| Kurtosis | 2.9168272 |
| Mean | 28.2908 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 1.5474818 |
| Sum | 2829080 |
| Variance | 509.78155 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.95 | 3660 | 3.7% |
| 9.95 | 3360 | 3.4% |
| 11.95 | 3230 | 3.2% |
| 13.95 | 3130 | 3.1% |
| 17.95 | 2930 | 2.9% |
| 12.95 | 2920 | 2.9% |
| 16.95 | 2890 | 2.9% |
| 15.95 | 2720 | 2.7% |
| 7.95 | 2670 | 2.7% |
| 14.95 | 2520 | 2.5% |
| Other values (131) | 69970 |
| Value | Count | Frequency (%) |
| 1.95 | 1730 | |
| 2.95 | 1990 | |
| 3.95 | 1420 | 1.4% |
| 4.95 | 1650 | |
| 5.95 | 1960 | |
| 6.95 | 2160 | |
| 7.95 | 2670 | |
| 8.95 | 3660 | |
| 9.95 | 3360 | |
| 10.95 | 2400 |
| Value | Count | Frequency (%) |
| 195.95 | 10 | |
| 178.95 | 10 | |
| 154.95 | 10 | |
| 152.95 | 20 | |
| 145.95 | 20 | |
| 144.95 | 10 | |
| 141.95 | 10 | |
| 140.95 | 10 | |
| 136.95 | 10 | |
| 135.95 | 10 |
ratio
Real number (ℝ)
| Distinct | 2722 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.54564586 |
| Minimum | 0.29648241 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 0.29648241 |
|---|---|
| 5-th percentile | 0.30283224 |
| Q1 | 0.35483871 |
| median | 0.52504358 |
| Q3 | 0.69924812 |
| 95-th percentile | 0.88868275 |
| Maximum | 1 |
| Range | 0.70351759 |
| Interquartile range (IQR) | 0.34440941 |
Descriptive statistics
| Standard deviation | 0.19436278 |
|---|---|
| Coefficient of variation (CV) | 0.35620682 |
| Kurtosis | -0.9113374 |
| Mean | 0.54564586 |
| Median Absolute Deviation (MAD) | 0.17124714 |
| Skewness | 0.39778993 |
| Sum | 54564.586 |
| Variance | 0.03777689 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1490 | 1.5% |
| 0.4936708861 | 1140 | 1.1% |
| 0.3103448276 | 830 | 0.8% |
| 0.332096475 | 820 | 0.8% |
| 0.3214862682 | 760 | 0.8% |
| 0.2988313856 | 730 | 0.7% |
| 0.746835443 | 720 | 0.7% |
| 0.3319415449 | 690 | 0.7% |
| 0.3317422434 | 570 | 0.6% |
| 0.3548387097 | 510 | 0.5% |
| Other values (2712) | 91740 |
| Value | Count | Frequency (%) |
| 0.2964824121 | 120 | 0.1% |
| 0.298245614 | 230 | 0.2% |
| 0.2988313856 | 730 | |
| 0.2991239049 | 140 | 0.1% |
| 0.2992992993 | 160 | 0.2% |
| 0.2994161802 | 40 | < 0.1% |
| 0.2994996426 | 310 | |
| 0.2995622264 | 50 | 0.1% |
| 0.2996108949 | 60 | 0.1% |
| 0.2996498249 | 70 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1490 | |
| 0.9920603414 | 10 | < 0.1% |
| 0.9917321207 | 10 | < 0.1% |
| 0.9915218313 | 10 | < 0.1% |
| 0.9904716532 | 10 | < 0.1% |
| 0.9899949975 | 10 | < 0.1% |
| 0.9890049478 | 10 | < 0.1% |
| 0.9880881477 | 10 | < 0.1% |
| 0.9857040743 | 20 | < 0.1% |
| 0.9841206828 | 10 | < 0.1% |
retailweek
Date
| Distinct | 123 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| Minimum | 2014-12-28 00:00:00 |
|---|---|
| Maximum | 2017-04-30 00:00:00 |
promo1
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| 0 | |
|---|---|
| 1 | 6190 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 100000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 93810 | |
| 1 | 6190 | 6.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 93810 | |
| 1 | 6190 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 93810 | |
| 1 | 6190 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 93810 | |
| 1 | 6190 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 93810 | |
| 1 | 6190 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 93810 | |
| 1 | 6190 | 6.2% |
promo2
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| 0 | |
|---|---|
| 1 | 490 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 100000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 99510 | |
| 1 | 490 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 99510 | |
| 1 | 490 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 99510 | |
| 1 | 490 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 99510 | |
| 1 | 490 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 99510 | |
| 1 | 490 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 99510 | |
| 1 | 490 | 0.5% |
customer_id
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 4549 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2721.7265 |
| Minimum | 1 |
|---|---|
| Maximum | 5999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 203 |
| Q1 | 1017 |
| median | 2091 |
| Q3 | 4570.25 |
| 95-th percentile | 5721.05 |
| Maximum | 5999 |
| Range | 5998 |
| Interquartile range (IQR) | 3553.25 |
Descriptive statistics
| Standard deviation | 1908.0855 |
|---|---|
| Coefficient of variation (CV) | 0.70105703 |
| Kurtosis | -1.4331178 |
| Mean | 2721.7265 |
| Median Absolute Deviation (MAD) | 1592 |
| Skewness | 0.24385097 |
| Sum | 2.7217265 × 108 |
| Variance | 3640790.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1692 | 80 | 0.1% |
| 1264 | 80 | 0.1% |
| 1111 | 80 | 0.1% |
| 1240 | 70 | 0.1% |
| 22 | 70 | 0.1% |
| 1726 | 70 | 0.1% |
| 1586 | 70 | 0.1% |
| 2328 | 70 | 0.1% |
| 5890 | 70 | 0.1% |
| 282 | 70 | 0.1% |
| Other values (4539) | 99270 |
| Value | Count | Frequency (%) |
| 1 | 10 | < 0.1% |
| 2 | 10 | < 0.1% |
| 3 | 40 | |
| 4 | 30 | |
| 5 | 30 | |
| 7 | 10 | < 0.1% |
| 8 | 20 | |
| 9 | 10 | < 0.1% |
| 10 | 10 | < 0.1% |
| 11 | 30 |
| Value | Count | Frequency (%) |
| 5999 | 20 | |
| 5998 | 10 | < 0.1% |
| 5997 | 10 | < 0.1% |
| 5996 | 20 | |
| 5995 | 40 | |
| 5994 | 30 | |
| 5992 | 30 | |
| 5991 | 20 | |
| 5990 | 10 | < 0.1% |
| 5989 | 20 |
article.1
Categorical
HIGH CORRELATION  UNIFORM 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| OC6355 | |
|---|---|
| AP5568 | |
| CB8861 | |
| LI3529 | |
| GG8661 | |
| Other values (5) |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 600000 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | OC6355 |
|---|---|
| 2nd row | AP5568 |
| 3rd row | CB8861 |
| 4th row | LI3529 |
| 5th row | GG8661 |
Common Values
| Value | Count | Frequency (%) |
| OC6355 | 10000 | |
| AP5568 | 10000 | |
| CB8861 | 10000 | |
| LI3529 | 10000 | |
| GG8661 | 10000 | |
| TX1463 | 10000 | |
| PC6383 | 10000 | |
| VT7698 | 10000 | |
| FG2965 | 10000 | |
| AC7347 | 10000 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| oc6355 | 10000 | |
| ap5568 | 10000 | |
| cb8861 | 10000 | |
| li3529 | 10000 | |
| gg8661 | 10000 | |
| tx1463 | 10000 | |
| pc6383 | 10000 | |
| vt7698 | 10000 | |
| fg2965 | 10000 | |
| ac7347 | 10000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 90000 | |
| 3 | 60000 | 10.0% |
| 5 | 60000 | 10.0% |
| 8 | 60000 | 10.0% |
| C | 40000 | 6.7% |
| 7 | 30000 | 5.0% |
| 1 | 30000 | 5.0% |
| 9 | 30000 | 5.0% |
| G | 30000 | 5.0% |
| 4 | 20000 | 3.3% |
| Other values (11) | 150000 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 600000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 6 | 90000 | |
| 3 | 60000 | 10.0% |
| 5 | 60000 | 10.0% |
| 8 | 60000 | 10.0% |
| C | 40000 | 6.7% |
| 7 | 30000 | 5.0% |
| 1 | 30000 | 5.0% |
| 9 | 30000 | 5.0% |
| G | 30000 | 5.0% |
| 4 | 20000 | 3.3% |
| Other values (11) | 150000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 600000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 6 | 90000 | |
| 3 | 60000 | 10.0% |
| 5 | 60000 | 10.0% |
| 8 | 60000 | 10.0% |
| C | 40000 | 6.7% |
| 7 | 30000 | 5.0% |
| 1 | 30000 | 5.0% |
| 9 | 30000 | 5.0% |
| G | 30000 | 5.0% |
| 4 | 20000 | 3.3% |
| Other values (11) | 150000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 600000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 6 | 90000 | |
| 3 | 60000 | 10.0% |
| 5 | 60000 | 10.0% |
| 8 | 60000 | 10.0% |
| C | 40000 | 6.7% |
| 7 | 30000 | 5.0% |
| 1 | 30000 | 5.0% |
| 9 | 30000 | 5.0% |
| G | 30000 | 5.0% |
| 4 | 20000 | 3.3% |
| Other values (11) | 150000 |
productgroup
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| SHOES | |
|---|---|
| HARDWARE ACCESSORIES | |
| SHORTS | |
| SWEATSHIRTS |
Length
| Max length | 20 |
|---|---|
| Median length | 5 |
| Mean length | 8.7 |
| Min length | 5 |
Characters and Unicode
| Total characters | 870000 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SHOES |
|---|---|
| 2nd row | SHORTS |
| 3rd row | HARDWARE ACCESSORIES |
| 4th row | SHOES |
| 5th row | SHOES |
Common Values
| Value | Count | Frequency (%) |
| SHOES | 60000 | |
| HARDWARE ACCESSORIES | 20000 | 20.0% |
| SHORTS | 10000 | 10.0% |
| SWEATSHIRTS | 10000 | 10.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| shoes | 60000 | |
| hardware | 20000 | 16.7% |
| accessories | 20000 | 16.7% |
| shorts | 10000 | 8.3% |
| sweatshirts | 10000 | 8.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 230000 | |
| E | 130000 | |
| H | 100000 | |
| O | 90000 | 10.3% |
| R | 80000 | 9.2% |
| A | 70000 | 8.0% |
| C | 40000 | 4.6% |
| W | 30000 | 3.4% |
| I | 30000 | 3.4% |
| T | 30000 | 3.4% |
| Other values (2) | 40000 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 870000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 230000 | |
| E | 130000 | |
| H | 100000 | |
| O | 90000 | 10.3% |
| R | 80000 | 9.2% |
| A | 70000 | 8.0% |
| C | 40000 | 4.6% |
| W | 30000 | 3.4% |
| I | 30000 | 3.4% |
| T | 30000 | 3.4% |
| Other values (2) | 40000 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 870000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 230000 | |
| E | 130000 | |
| H | 100000 | |
| O | 90000 | 10.3% |
| R | 80000 | 9.2% |
| A | 70000 | 8.0% |
| C | 40000 | 4.6% |
| W | 30000 | 3.4% |
| I | 30000 | 3.4% |
| T | 30000 | 3.4% |
| Other values (2) | 40000 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 870000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 230000 | |
| E | 130000 | |
| H | 100000 | |
| O | 90000 | 10.3% |
| R | 80000 | 9.2% |
| A | 70000 | 8.0% |
| C | 40000 | 4.6% |
| W | 30000 | 3.4% |
| I | 30000 | 3.4% |
| T | 30000 | 3.4% |
| Other values (2) | 40000 | 4.6% |
category
Categorical
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| TRAINING | |
|---|---|
| RUNNING | |
| FOOTBALL GENERIC | |
| GOLF | |
| RELAX CASUAL |
Length
| Max length | 16 |
|---|---|
| Median length | 10 |
| Mean length | 9.2 |
| Min length | 4 |
Characters and Unicode
| Total characters | 920000 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TRAINING |
|---|---|
| 2nd row | TRAINING |
| 3rd row | GOLF |
| 4th row | RUNNING |
| 5th row | RELAX CASUAL |
Common Values
| Value | Count | Frequency (%) |
| TRAINING | 30000 | |
| RUNNING | 20000 | |
| FOOTBALL GENERIC | 20000 | |
| GOLF | 10000 | 10.0% |
| RELAX CASUAL | 10000 | 10.0% |
| INDOOR | 10000 | 10.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| training | 30000 | |
| running | 20000 | |
| football | 20000 | |
| generic | 20000 | |
| golf | 10000 | 7.7% |
| relax | 10000 | 7.7% |
| casual | 10000 | 7.7% |
| indoor | 10000 | 7.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 150000 | |
| I | 110000 | |
| R | 90000 | |
| A | 80000 | |
| G | 80000 | |
| O | 70000 | |
| L | 70000 | |
| E | 50000 | 5.4% |
| T | 50000 | 5.4% |
| F | 30000 | 3.3% |
| Other values (7) | 140000 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 920000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 150000 | |
| I | 110000 | |
| R | 90000 | |
| A | 80000 | |
| G | 80000 | |
| O | 70000 | |
| L | 70000 | |
| E | 50000 | 5.4% |
| T | 50000 | 5.4% |
| F | 30000 | 3.3% |
| Other values (7) | 140000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 920000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 150000 | |
| I | 110000 | |
| R | 90000 | |
| A | 80000 | |
| G | 80000 | |
| O | 70000 | |
| L | 70000 | |
| E | 50000 | 5.4% |
| T | 50000 | 5.4% |
| F | 30000 | 3.3% |
| Other values (7) | 140000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 920000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 150000 | |
| I | 110000 | |
| R | 90000 | |
| A | 80000 | |
| G | 80000 | |
| O | 70000 | |
| L | 70000 | |
| E | 50000 | 5.4% |
| T | 50000 | 5.4% |
| F | 30000 | 3.3% |
| Other values (7) | 140000 |
cost
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.517 |
| Minimum | 1.29 |
|---|---|
| Maximum | 13.29 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 1.29 |
|---|---|
| 5-th percentile | 1.29 |
| Q1 | 2.29 |
| median | 6.95 |
| Q3 | 9.6 |
| 95-th percentile | 13.29 |
| Maximum | 13.29 |
| Range | 12 |
| Interquartile range (IQR) | 7.31 |
Descriptive statistics
| Standard deviation | 3.9147279 |
|---|---|
| Coefficient of variation (CV) | 0.60069478 |
| Kurtosis | -1.2872918 |
| Mean | 6.517 |
| Median Absolute Deviation (MAD) | 2.85 |
| Skewness | 0.099353368 |
| Sum | 651700 |
| Variance | 15.325094 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13.29 | 10000 | |
| 2.29 | 10000 | |
| 1.7 | 10000 | |
| 9 | 10000 | |
| 9.6 | 10000 | |
| 4.2 | 10000 | |
| 9.9 | 10000 | |
| 5.2 | 10000 | |
| 1.29 | 10000 | |
| 8.7 | 10000 |
| Value | Count | Frequency (%) |
| 1.29 | 10000 | |
| 1.7 | 10000 | |
| 2.29 | 10000 | |
| 4.2 | 10000 | |
| 5.2 | 10000 | |
| 8.7 | 10000 | |
| 9 | 10000 | |
| 9.6 | 10000 | |
| 9.9 | 10000 | |
| 13.29 | 10000 |
| Value | Count | Frequency (%) |
| 13.29 | 10000 | |
| 9.9 | 10000 | |
| 9.6 | 10000 | |
| 9 | 10000 | |
| 8.7 | 10000 | |
| 5.2 | 10000 | |
| 4.2 | 10000 | |
| 2.29 | 10000 | |
| 1.7 | 10000 | |
| 1.29 | 10000 |
style
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| regular | |
|---|---|
| wide | |
| slim |
Length
| Max length | 7 |
|---|---|
| Median length | 5.5 |
| Mean length | 5.5 |
| Min length | 4 |
Characters and Unicode
| Total characters | 550000 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | slim |
|---|---|
| 2nd row | regular |
| 3rd row | regular |
| 4th row | regular |
| 5th row | regular |
Common Values
| Value | Count | Frequency (%) |
| regular | 50000 | |
| wide | 30000 | |
| slim | 20000 | 20.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| regular | 50000 | |
| wide | 30000 | |
| slim | 20000 | 20.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 100000 | |
| e | 80000 | |
| l | 70000 | |
| g | 50000 | |
| u | 50000 | |
| a | 50000 | |
| i | 50000 | |
| w | 30000 | 5.5% |
| d | 30000 | 5.5% |
| s | 20000 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 550000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 100000 | |
| e | 80000 | |
| l | 70000 | |
| g | 50000 | |
| u | 50000 | |
| a | 50000 | |
| i | 50000 | |
| w | 30000 | 5.5% |
| d | 30000 | 5.5% |
| s | 20000 | 3.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 550000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 100000 | |
| e | 80000 | |
| l | 70000 | |
| g | 50000 | |
| u | 50000 | |
| a | 50000 | |
| i | 50000 | |
| w | 30000 | 5.5% |
| d | 30000 | 5.5% |
| s | 20000 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 550000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 100000 | |
| e | 80000 | |
| l | 70000 | |
| g | 50000 | |
| u | 50000 | |
| a | 50000 | |
| i | 50000 | |
| w | 30000 | 5.5% |
| d | 30000 | 5.5% |
| s | 20000 | 3.6% |
sizes
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| xxs,xs,s,m,l,xl,xxl | |
|---|---|
| xs,s,m,l,xl |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 18.2 |
| Min length | 11 |
Characters and Unicode
| Total characters | 1820000 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | xxs,xs,s,m,l,xl,xxl |
|---|---|
| 2nd row | xxs,xs,s,m,l,xl,xxl |
| 3rd row | xxs,xs,s,m,l,xl,xxl |
| 4th row | xxs,xs,s,m,l,xl,xxl |
| 5th row | xxs,xs,s,m,l,xl,xxl |
Common Values
| Value | Count | Frequency (%) |
| xxs,xs,s,m,l,xl,xxl | 90000 | |
| xs,s,m,l,xl | 10000 | 10.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| xxs,xs,s,m,l,xl,xxl | 90000 | |
| xs,s,m,l,xl | 10000 | 10.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| , | 580000 | |
| x | 560000 | |
| s | 290000 | |
| l | 290000 | |
| m | 100000 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1820000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| , | 580000 | |
| x | 560000 | |
| s | 290000 | |
| l | 290000 | |
| m | 100000 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1820000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| , | 580000 | |
| x | 560000 | |
| s | 290000 | |
| l | 290000 | |
| m | 100000 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1820000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| , | 580000 | |
| x | 560000 | |
| s | 290000 | |
| l | 290000 | |
| m | 100000 | 5.5% |
gender
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| women | |
|---|---|
| kids | |
| unisex | |
| men |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.8 |
| Min length | 3 |
Characters and Unicode
| Total characters | 480000 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | women |
|---|---|
| 2nd row | women |
| 3rd row | women |
| 4th row | kids |
| 5th row | women |
Common Values
| Value | Count | Frequency (%) |
| women | 70000 | |
| kids | 10000 | 10.0% |
| unisex | 10000 | 10.0% |
| men | 10000 | 10.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| women | 70000 | |
| kids | 10000 | 10.0% |
| unisex | 10000 | 10.0% |
| men | 10000 | 10.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 90000 | |
| n | 90000 | |
| m | 80000 | |
| w | 70000 | |
| o | 70000 | |
| i | 20000 | 4.2% |
| s | 20000 | 4.2% |
| k | 10000 | 2.1% |
| d | 10000 | 2.1% |
| u | 10000 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 480000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 90000 | |
| n | 90000 | |
| m | 80000 | |
| w | 70000 | |
| o | 70000 | |
| i | 20000 | 4.2% |
| s | 20000 | 4.2% |
| k | 10000 | 2.1% |
| d | 10000 | 2.1% |
| u | 10000 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 480000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 90000 | |
| n | 90000 | |
| m | 80000 | |
| w | 70000 | |
| o | 70000 | |
| i | 20000 | 4.2% |
| s | 20000 | 4.2% |
| k | 10000 | 2.1% |
| d | 10000 | 2.1% |
| u | 10000 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 480000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 90000 | |
| n | 90000 | |
| m | 80000 | |
| w | 70000 | |
| o | 70000 | |
| i | 20000 | 4.2% |
| s | 20000 | 4.2% |
| k | 10000 | 2.1% |
| d | 10000 | 2.1% |
| u | 10000 | 2.1% |
rgb_r_main_col
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 161.4 |
| Minimum | 79 |
|---|---|
| Maximum | 205 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 79 |
|---|---|
| 5-th percentile | 79 |
| Q1 | 138 |
| median | 160 |
| Q3 | 205 |
| 95-th percentile | 205 |
| Maximum | 205 |
| Range | 126 |
| Interquartile range (IQR) | 67 |
Descriptive statistics
| Standard deviation | 39.790147 |
|---|---|
| Coefficient of variation (CV) | 0.24653127 |
| Kurtosis | -0.65105527 |
| Mean | 161.4 |
| Median Absolute Deviation (MAD) | 26.5 |
| Skewness | -0.53681348 |
| Sum | 16140000 |
| Variance | 1583.2558 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 205 | 30000 | |
| 139 | 20000 | |
| 188 | 10000 | 10.0% |
| 138 | 10000 | 10.0% |
| 79 | 10000 | 10.0% |
| 135 | 10000 | 10.0% |
| 181 | 10000 | 10.0% |
| Value | Count | Frequency (%) |
| 79 | 10000 | 10.0% |
| 135 | 10000 | 10.0% |
| 138 | 10000 | 10.0% |
| 139 | 20000 | |
| 181 | 10000 | 10.0% |
| 188 | 10000 | 10.0% |
| 205 | 30000 |
| Value | Count | Frequency (%) |
| 205 | 30000 | |
| 188 | 10000 | 10.0% |
| 181 | 10000 | 10.0% |
| 139 | 20000 | |
| 138 | 10000 | 10.0% |
| 135 | 10000 | 10.0% |
| 79 | 10000 | 10.0% |
rgb_g_main_col
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 139.6 |
| Minimum | 26 |
|---|---|
| Maximum | 238 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 26 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 104 |
| median | 144 |
| Q3 | 181 |
| 95-th percentile | 238 |
| Maximum | 238 |
| Range | 212 |
| Interquartile range (IQR) | 77 |
Descriptive statistics
| Standard deviation | 63.641814 |
|---|---|
| Coefficient of variation (CV) | 0.45588692 |
| Kurtosis | -0.72863911 |
| Mean | 139.6 |
| Median Absolute Deviation (MAD) | 38.5 |
| Skewness | -0.41055271 |
| Sum | 13960000 |
| Variance | 4050.2805 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 104 | 10000 | |
| 238 | 10000 | |
| 173 | 10000 | |
| 140 | 10000 | |
| 43 | 10000 | |
| 148 | 10000 | |
| 26 | 10000 | |
| 206 | 10000 | |
| 181 | 10000 | |
| 137 | 10000 |
| Value | Count | Frequency (%) |
| 26 | 10000 | |
| 43 | 10000 | |
| 104 | 10000 | |
| 137 | 10000 | |
| 140 | 10000 | |
| 148 | 10000 | |
| 173 | 10000 | |
| 181 | 10000 | |
| 206 | 10000 | |
| 238 | 10000 |
| Value | Count | Frequency (%) |
| 238 | 10000 | |
| 206 | 10000 | |
| 181 | 10000 | |
| 173 | 10000 | |
| 148 | 10000 | |
| 140 | 10000 | |
| 137 | 10000 | |
| 104 | 10000 | |
| 43 | 10000 | |
| 26 | 10000 |
rgb_b_main_col
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 133.5 |
| Minimum | 0 |
|---|---|
| Maximum | 250 |
| Zeros | 10000 |
| Zeros (%) | 10.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 781.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 57 |
| median | 143 |
| Q3 | 205 |
| 95-th percentile | 250 |
| Maximum | 250 |
| Range | 250 |
| Interquartile range (IQR) | 148 |
Descriptive statistics
| Standard deviation | 81.148727 |
|---|---|
| Coefficient of variation (CV) | 0.60785563 |
| Kurtosis | -1.2130238 |
| Mean | 133.5 |
| Median Absolute Deviation (MAD) | 72.5 |
| Skewness | -0.23314943 |
| Sum | 13350000 |
| Variance | 6585.1159 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 57 | 10000 | |
| 104 | 10000 | |
| 0 | 10000 | |
| 149 | 10000 | |
| 226 | 10000 | |
| 205 | 10000 | |
| 26 | 10000 | |
| 250 | 10000 | |
| 181 | 10000 | |
| 137 | 10000 |
| Value | Count | Frequency (%) |
| 0 | 10000 | |
| 26 | 10000 | |
| 57 | 10000 | |
| 104 | 10000 | |
| 137 | 10000 | |
| 149 | 10000 | |
| 181 | 10000 | |
| 205 | 10000 | |
| 226 | 10000 | |
| 250 | 10000 |
| Value | Count | Frequency (%) |
| 250 | 10000 | |
| 226 | 10000 | |
| 205 | 10000 | |
| 181 | 10000 | |
| 149 | 10000 | |
| 137 | 10000 | |
| 104 | 10000 | |
| 57 | 10000 | |
| 26 | 10000 | |
| 0 | 10000 |
rgb_r_sec_col
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| 205 | |
|---|---|
| 255 | |
| 164 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 255 |
|---|---|
| 2nd row | 255 |
| 3rd row | 255 |
| 4th row | 164 |
| 5th row | 164 |
Common Values
| Value | Count | Frequency (%) |
| 205 | 40000 | |
| 255 | 30000 | |
| 164 | 30000 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 205 | 40000 | |
| 255 | 30000 | |
| 164 | 30000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 100000 | |
| 2 | 70000 | |
| 0 | 40000 | 13.3% |
| 1 | 30000 | 10.0% |
| 6 | 30000 | 10.0% |
| 4 | 30000 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5 | 100000 | |
| 2 | 70000 | |
| 0 | 40000 | 13.3% |
| 1 | 30000 | 10.0% |
| 6 | 30000 | 10.0% |
| 4 | 30000 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5 | 100000 | |
| 2 | 70000 | |
| 0 | 40000 | 13.3% |
| 1 | 30000 | 10.0% |
| 6 | 30000 | 10.0% |
| 4 | 30000 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5 | 100000 | |
| 2 | 70000 | |
| 0 | 40000 | 13.3% |
| 1 | 30000 | 10.0% |
| 6 | 30000 | 10.0% |
| 4 | 30000 | 10.0% |
rgb_g_sec_col
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| 155 | |
|---|---|
| 187 | |
| 211 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 187 |
|---|---|
| 2nd row | 187 |
| 3rd row | 187 |
| 4th row | 211 |
| 5th row | 211 |
Common Values
| Value | Count | Frequency (%) |
| 155 | 40000 | |
| 187 | 30000 | |
| 211 | 30000 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 155 | 40000 | |
| 187 | 30000 | |
| 211 | 30000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 130000 | |
| 5 | 80000 | |
| 8 | 30000 | 10.0% |
| 7 | 30000 | 10.0% |
| 2 | 30000 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 130000 | |
| 5 | 80000 | |
| 8 | 30000 | 10.0% |
| 7 | 30000 | 10.0% |
| 2 | 30000 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 130000 | |
| 5 | 80000 | |
| 8 | 30000 | 10.0% |
| 7 | 30000 | 10.0% |
| 2 | 30000 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 130000 | |
| 5 | 80000 | |
| 8 | 30000 | 10.0% |
| 7 | 30000 | 10.0% |
| 2 | 30000 | 10.0% |
rgb_b_sec_col
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| 155 | |
|---|---|
| 255 | |
| 238 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 255 |
|---|---|
| 2nd row | 255 |
| 3rd row | 255 |
| 4th row | 238 |
| 5th row | 238 |
Common Values
| Value | Count | Frequency (%) |
| 155 | 40000 | |
| 255 | 30000 | |
| 238 | 30000 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 155 | 40000 | |
| 255 | 30000 | |
| 238 | 30000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 140000 | |
| 2 | 60000 | |
| 1 | 40000 | 13.3% |
| 3 | 30000 | 10.0% |
| 8 | 30000 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5 | 140000 | |
| 2 | 60000 | |
| 1 | 40000 | 13.3% |
| 3 | 30000 | 10.0% |
| 8 | 30000 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5 | 140000 | |
| 2 | 60000 | |
| 1 | 40000 | 13.3% |
| 3 | 30000 | 10.0% |
| 8 | 30000 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 300000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5 | 140000 | |
| 2 | 60000 | |
| 1 | 40000 | 13.3% |
| 3 | 30000 | 10.0% |
| 8 | 30000 | 10.0% |
label
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.4 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 100000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 86072 | |
| 1 | 13928 | 13.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 86072 | |
| 1 | 13928 | 13.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 86072 | |
| 1 | 13928 | 13.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 86072 | |
| 1 | 13928 | 13.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 86072 | |
| 1 | 13928 | 13.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 86072 | |
| 1 | 13928 | 13.9% |
| article.1 | category | cost | country | current_price | customer_id | gender | label | productgroup | promo1 | promo2 | ratio | regular_price | rgb_b_main_col | rgb_b_sec_col | rgb_g_main_col | rgb_g_sec_col | rgb_r_main_col | rgb_r_sec_col | sales | sizes | style | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| article.1 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.003 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 |
| category | 1.000 | 1.000 | 0.699 | 0.000 | 0.000 | 0.000 | 0.690 | 0.005 | 0.638 | 0.000 | 0.000 | 0.000 | 0.000 | 0.837 | 0.795 | 0.820 | 0.795 | 0.674 | 0.795 | 0.000 | 0.667 | 0.589 |
| cost | 1.000 | 0.699 | 1.000 | 0.000 | 0.000 | 0.000 | 0.724 | 0.000 | 0.816 | 0.000 | 0.000 | 0.000 | 0.000 | -0.091 | 0.782 | -0.818 | 0.782 | 0.012 | 0.782 | 0.000 | 1.000 | 0.876 |
| country | 0.000 | 0.000 | 0.000 | 1.000 | 0.076 | 0.918 | 0.000 | 0.010 | 0.000 | 0.008 | 0.164 | 0.043 | 0.175 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.026 | 0.000 | 0.000 |
| current_price | 0.000 | 0.000 | 0.000 | 0.076 | 1.000 | 0.000 | 0.000 | 0.181 | 0.000 | 0.069 | 0.029 | 0.372 | 0.885 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | -0.178 | 0.000 | 0.000 |
| customer_id | 0.000 | 0.000 | 0.000 | 0.918 | 0.000 | 1.000 | 0.000 | 0.016 | 0.000 | 0.018 | 0.144 | 0.009 | -0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.004 | 0.000 | 0.000 |
| gender | 1.000 | 0.690 | 0.724 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.309 | 0.000 | 0.000 | 0.000 | 0.000 | 0.816 | 0.546 | 0.577 | 0.546 | 0.445 | 0.546 | 0.000 | 1.000 | 0.483 |
| label | 0.003 | 0.005 | 0.000 | 0.010 | 0.181 | 0.016 | 0.000 | 1.000 | 0.000 | 0.064 | 0.020 | 0.462 | 0.022 | 0.000 | 0.000 | 0.005 | 0.000 | 0.000 | 0.000 | 0.099 | 0.000 | 0.000 |
| productgroup | 1.000 | 0.638 | 0.816 | 0.000 | 0.000 | 0.000 | 0.309 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.553 | 0.861 | 0.553 | 0.776 | 0.553 | 0.000 | 0.272 | 0.494 |
| promo1 | 0.000 | 0.000 | 0.000 | 0.008 | 0.069 | 0.018 | 0.000 | 0.064 | 0.000 | 1.000 | 0.047 | 0.162 | 0.016 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.126 | 0.000 | 0.000 |
| promo2 | 0.000 | 0.000 | 0.000 | 0.164 | 0.029 | 0.144 | 0.000 | 0.020 | 0.000 | 0.047 | 1.000 | 0.044 | 0.028 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.016 | 0.000 | 0.000 |
| ratio | 0.000 | 0.000 | 0.000 | 0.043 | 0.372 | 0.009 | 0.000 | 0.462 | 0.000 | 0.162 | 0.044 | 1.000 | -0.071 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | -0.435 | 0.000 | 0.000 |
| regular_price | 0.000 | 0.000 | 0.000 | 0.175 | 0.885 | -0.000 | 0.000 | 0.022 | 0.000 | 0.016 | 0.028 | -0.071 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.011 | 0.000 | 0.000 |
| rgb_b_main_col | 1.000 | 0.837 | -0.091 | 0.000 | 0.000 | 0.000 | 0.816 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.842 | 0.212 | 0.842 | -0.689 | 0.842 | 0.000 | 1.000 | 0.931 |
| rgb_b_sec_col | 1.000 | 0.795 | 0.782 | 0.000 | 0.000 | 0.000 | 0.546 | 0.000 | 0.553 | 0.000 | 0.000 | 0.000 | 0.000 | 0.842 | 1.000 | 0.812 | 1.000 | 0.643 | 1.000 | 0.000 | 0.408 | 0.400 |
| rgb_g_main_col | 1.000 | 0.820 | -0.818 | 0.000 | 0.000 | 0.000 | 0.577 | 0.005 | 0.861 | 0.000 | 0.000 | 0.000 | 0.000 | 0.212 | 0.812 | 1.000 | 0.812 | 0.037 | 0.812 | 0.000 | 0.667 | 0.830 |
| rgb_g_sec_col | 1.000 | 0.795 | 0.782 | 0.000 | 0.000 | 0.000 | 0.546 | 0.000 | 0.553 | 0.000 | 0.000 | 0.000 | 0.000 | 0.842 | 1.000 | 0.812 | 1.000 | 0.643 | 1.000 | 0.000 | 0.408 | 0.400 |
| rgb_r_main_col | 1.000 | 0.674 | 0.012 | 0.000 | 0.000 | 0.000 | 0.445 | 0.000 | 0.776 | 0.000 | 0.000 | 0.000 | 0.000 | -0.689 | 0.643 | 0.037 | 0.643 | 1.000 | 0.643 | 0.000 | 0.408 | 0.570 |
| rgb_r_sec_col | 1.000 | 0.795 | 0.782 | 0.000 | 0.000 | 0.000 | 0.546 | 0.000 | 0.553 | 0.000 | 0.000 | 0.000 | 0.000 | 0.842 | 1.000 | 0.812 | 1.000 | 0.643 | 1.000 | 0.000 | 0.408 | 0.400 |
| sales | 0.000 | 0.000 | 0.000 | 0.026 | -0.178 | 0.004 | 0.000 | 0.099 | 0.000 | 0.126 | 0.016 | -0.435 | 0.011 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| sizes | 1.000 | 0.667 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.272 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.408 | 0.667 | 0.408 | 0.408 | 0.408 | 0.000 | 1.000 | 0.509 |
| style | 1.000 | 0.589 | 0.876 | 0.000 | 0.000 | 0.000 | 0.483 | 0.000 | 0.494 | 0.000 | 0.000 | 0.000 | 0.000 | 0.931 | 0.400 | 0.830 | 0.400 | 0.570 | 0.400 | 0.000 | 0.509 | 1.000 |
| country | article | sales | regular_price | current_price | ratio | retailweek | promo1 | promo2 | customer_id | article.1 | productgroup | category | cost | style | sizes | gender | rgb_r_main_col | rgb_g_main_col | rgb_b_main_col | rgb_r_sec_col | rgb_g_sec_col | rgb_b_sec_col | label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Germany | YN8639 | 28 | 5.95 | 3.95 | 0.663866 | 2016-03-27 | 0 | 0 | 1003.0 | OC6355 | SHOES | TRAINING | 13.29 | slim | xxs,xs,s,m,l,xl,xxl | women | 205 | 104 | 57 | 255 | 187 | 255 | 0 |
| 1 | Germany | YN8639 | 28 | 5.95 | 3.95 | 0.663866 | 2016-03-27 | 0 | 0 | 1003.0 | AP5568 | SHORTS | TRAINING | 2.29 | regular | xxs,xs,s,m,l,xl,xxl | women | 188 | 238 | 104 | 255 | 187 | 255 | 0 |
| 2 | Germany | YN8639 | 28 | 5.95 | 3.95 | 0.663866 | 2016-03-27 | 0 | 0 | 1003.0 | CB8861 | HARDWARE ACCESSORIES | GOLF | 1.70 | regular | xxs,xs,s,m,l,xl,xxl | women | 205 | 173 | 0 | 255 | 187 | 255 | 0 |
| 3 | Germany | YN8639 | 28 | 5.95 | 3.95 | 0.663866 | 2016-03-27 | 0 | 0 | 1003.0 | LI3529 | SHOES | RUNNING | 9.00 | regular | xxs,xs,s,m,l,xl,xxl | kids | 205 | 140 | 149 | 164 | 211 | 238 | 0 |
| 4 | Germany | YN8639 | 28 | 5.95 | 3.95 | 0.663866 | 2016-03-27 | 0 | 0 | 1003.0 | GG8661 | SHOES | RELAX CASUAL | 9.60 | regular | xxs,xs,s,m,l,xl,xxl | women | 138 | 43 | 226 | 164 | 211 | 238 | 0 |
| 5 | Germany | YN8639 | 28 | 5.95 | 3.95 | 0.663866 | 2016-03-27 | 0 | 0 | 1003.0 | TX1463 | SWEATSHIRTS | TRAINING | 4.20 | wide | xxs,xs,s,m,l,xl,xxl | women | 79 | 148 | 205 | 164 | 211 | 238 | 1 |
| 6 | Germany | YN8639 | 28 | 5.95 | 3.95 | 0.663866 | 2016-03-27 | 0 | 0 | 1003.0 | PC6383 | SHOES | FOOTBALL GENERIC | 9.90 | wide | xs,s,m,l,xl | unisex | 139 | 26 | 26 | 205 | 155 | 155 | 0 |
| 7 | Germany | YN8639 | 28 | 5.95 | 3.95 | 0.663866 | 2016-03-27 | 0 | 0 | 1003.0 | VT7698 | SHOES | INDOOR | 5.20 | wide | xxs,xs,s,m,l,xl,xxl | women | 135 | 206 | 250 | 205 | 155 | 155 | 1 |
| 8 | Germany | YN8639 | 28 | 5.95 | 3.95 | 0.663866 | 2016-03-27 | 0 | 0 | 1003.0 | FG2965 | HARDWARE ACCESSORIES | RUNNING | 1.29 | slim | xxs,xs,s,m,l,xl,xxl | women | 181 | 181 | 181 | 205 | 155 | 155 | 0 |
| 9 | Germany | YN8639 | 28 | 5.95 | 3.95 | 0.663866 | 2016-03-27 | 0 | 0 | 1003.0 | AC7347 | SHOES | FOOTBALL GENERIC | 8.70 | regular | xxs,xs,s,m,l,xl,xxl | men | 139 | 137 | 137 | 205 | 155 | 155 | 1 |
| country | article | sales | regular_price | current_price | ratio | retailweek | promo1 | promo2 | customer_id | article.1 | productgroup | category | cost | style | sizes | gender | rgb_r_main_col | rgb_g_main_col | rgb_b_main_col | rgb_r_sec_col | rgb_g_sec_col | rgb_b_sec_col | label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 99990 | Germany | PW6278 | 227 | 57.95 | 26.95 | 0.465056 | 2016-06-26 | 0 | 0 | 1489.0 | OC6355 | SHOES | TRAINING | 13.29 | slim | xxs,xs,s,m,l,xl,xxl | women | 205 | 104 | 57 | 255 | 187 | 255 | 0 |
| 99991 | Germany | PW6278 | 227 | 57.95 | 26.95 | 0.465056 | 2016-06-26 | 0 | 0 | 1489.0 | AP5568 | SHORTS | TRAINING | 2.29 | regular | xxs,xs,s,m,l,xl,xxl | women | 188 | 238 | 104 | 255 | 187 | 255 | 0 |
| 99992 | Germany | PW6278 | 227 | 57.95 | 26.95 | 0.465056 | 2016-06-26 | 0 | 0 | 1489.0 | CB8861 | HARDWARE ACCESSORIES | GOLF | 1.70 | regular | xxs,xs,s,m,l,xl,xxl | women | 205 | 173 | 0 | 255 | 187 | 255 | 0 |
| 99993 | Germany | PW6278 | 227 | 57.95 | 26.95 | 0.465056 | 2016-06-26 | 0 | 0 | 1489.0 | LI3529 | SHOES | RUNNING | 9.00 | regular | xxs,xs,s,m,l,xl,xxl | kids | 205 | 140 | 149 | 164 | 211 | 238 | 0 |
| 99994 | Germany | PW6278 | 227 | 57.95 | 26.95 | 0.465056 | 2016-06-26 | 0 | 0 | 1489.0 | GG8661 | SHOES | RELAX CASUAL | 9.60 | regular | xxs,xs,s,m,l,xl,xxl | women | 138 | 43 | 226 | 164 | 211 | 238 | 0 |
| 99995 | Germany | PW6278 | 227 | 57.95 | 26.95 | 0.465056 | 2016-06-26 | 0 | 0 | 1489.0 | TX1463 | SWEATSHIRTS | TRAINING | 4.20 | wide | xxs,xs,s,m,l,xl,xxl | women | 79 | 148 | 205 | 164 | 211 | 238 | 0 |
| 99996 | Germany | PW6278 | 227 | 57.95 | 26.95 | 0.465056 | 2016-06-26 | 0 | 0 | 1489.0 | PC6383 | SHOES | FOOTBALL GENERIC | 9.90 | wide | xs,s,m,l,xl | unisex | 139 | 26 | 26 | 205 | 155 | 155 | 0 |
| 99997 | Germany | PW6278 | 227 | 57.95 | 26.95 | 0.465056 | 2016-06-26 | 0 | 0 | 1489.0 | VT7698 | SHOES | INDOOR | 5.20 | wide | xxs,xs,s,m,l,xl,xxl | women | 135 | 206 | 250 | 205 | 155 | 155 | 0 |
| 99998 | Germany | PW6278 | 227 | 57.95 | 26.95 | 0.465056 | 2016-06-26 | 0 | 0 | 1489.0 | FG2965 | HARDWARE ACCESSORIES | RUNNING | 1.29 | slim | xxs,xs,s,m,l,xl,xxl | women | 181 | 181 | 181 | 205 | 155 | 155 | 0 |
| 99999 | Germany | PW6278 | 227 | 57.95 | 26.95 | 0.465056 | 2016-06-26 | 0 | 0 | 1489.0 | AC7347 | SHOES | FOOTBALL GENERIC | 8.70 | regular | xxs,xs,s,m,l,xl,xxl | men | 139 | 137 | 137 | 205 | 155 | 155 | 0 |